Introducing Project Telos: Modeling, Measuring, and Intervening on Goal-directed Behavior in AI Systems
lesswrong.com·13h
🪄Prompt Engineering
Flag this post
Cloud CISO Perspectives: AI as a strategic imperative to manage risk
cloud.google.com·6h
🆕New AI
Flag this post
GenAI Poisoning: How Fewer Than 100 Samples Can Corrupt a Multi-Billion Parameter Model
pub.towardsai.net·7h
🛡️AI Security
Flag this post
Anthropic's Pilot Sabotage Risk Report
🛡️Anthropic PBC
Flag this post
MIT’s Survey On Accelerators and Processors for Inference, With Peak Performance And Power Comparisons
semiengineering.com·5h
🏗️LLM Infrastructure
Flag this post
Academic Integrity Working Group addresses generative AI and exam policies
news.stanford.edu·22h
🛡️AI Security
Flag this post
Project-MONAI/MONAI
github.com·21h
🔎Meilisearch
Flag this post
AI coding is moving faster than the guardrails meant to secure it and that's risky business.
🛡️AI Security
Flag this post
Too much social media gives AI chatbots ‘brain rot’
nature.com·11h
🛡️AI Security
Flag this post
Stop Writing Code, Start Writing Docs
thenewstack.io·3h
🪄Prompt Engineering
Flag this post
🧠🚀 Excited to introduce Supervised Reinforcement Learning—a framework that leverages expert trajectories to teach small LMs how to reason through hard problems ...
threadreaderapp.com·20h
🏗️LLM Infrastructure
Flag this post
Social media feeds 'misaligned' when viewed through AI safety framework
📊Feed Optimization
Flag this post
Waymo accused of hitting cat shows why AI needs to be perfect
semafor.com·5h
🆕New AI
Flag this post
Study: AI Models Trained On Clickbait Slop Result In AI ‘Brain Rot,’ ‘Hostility’
🛡️Content Moderation
Flag this post
Big Tech’s market dominance is becoming ever more extreme
ft.com·3h
🖥GPUs
Flag this post
Why It’s Time to Sunset the Turing Test
cacm.acm.org·4h
🛡️Anthropic PBC
Flag this post
OpenAI launches Aardvark to detect and patch hidden bugs in code
infoworld.com·10h
🔓Open Source Software
Flag this post
The AI Buildout Is So Big Even a Haunted House Owner Wants in
bloomberg.com·12h
🆕New AI
Flag this post
Loading...Loading more...